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# 

This  thesis  develops  a computer  algorithm  that  will 
locate  and  identify  the  P wave  in  an  electrocardiogram 
(ECG) . This  program  does  not  assume  that  the  P wave  is 
located  a certain  distance  from  the  R wave  or  other  land- 
marks in  the  ECO.  This  will  allow  the  program  to  locate 
disassociated  P waves. 

I would  like  to  thank  those  who  have  contributed  their 
time  and  guidance  in  this  research.  I give  special  thanks 
to  my  thesis  advisor,  Dr.  Matthew  Kabriskv,  who  proposed 
this  study.  Edwin  L.  Stanley,  M.D. , Staff  of  Cox  Heart 
Institute,  and  M.  Edward  Womble , Ph.D.,  Clinical  Science 
Division,  School  of  Aerospace  Medicine,  gave  valuable 
guidance . 

I also  wish  to  than);  my  wife  Carol,  for  the  support 
she  has  always  given  me. 
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Abstract 

Algorithms  to  locate  and  identify  the  P wave  in  single 
lead  electrocardiograms  (ECG's)  are  developed  and  evaluated 
on  eighteen  12  second  electrocardiogram  recordings.  The 
program  does  not  assume  that  the  P wave  has  any  specific 
time  relationship  to  the  occurrence  of  the  QRS  complex. 

The  program  searches  the  entire  ECG  to  locate  the  P wave 
in  any  part  of  the  ECG.  The  program  divides  the  ECG  into 
sections  and  classifies  the  sections  as  P waves  or  some 
other  wave.  The  first  test  selects  the  sections  that  have 
a frequency  domain  pattern  that  is  similar  to  a training 
set  P wave.  The  sections  that  pass  this  test  are  checked 
for  a periodic  relationship  with  past  selected  sections. 

If  a section  was  selected  at  a time  assumed  to  be  one  P 
wave  period  before  the  present  one,  the  present  one  will  be 
classified  as  a P wave.  Of  the  209  P waves  in  the  ECG 
records,  123  or  58.85%  were  located  correctly.  There  were 
33  false  classifications  of  sections  which  were  not  P waves. y 
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COMPUTER  IDENTIFICATION  OF 
P WAVES  III  ELECTROCARDIOGRAMS 


I . Introduction 

The  purpose  of  this  thesis  is  to  develop  a computer 
program  that  can  identify  the  P wave  in  an  electrocardio- 
gram (ECG)  without  restricting  the  region  of  the  search. 

The  program  will  not  reauire  operator  intervention  to 
initiate  ECG  monitoring. 

Background 

Coronary  Care  Units  (CCU's)  have  been  set  up  in  most 
hospitals  to  care  for  people  who  have  suffered  heart 
attacks.  Up  to  a few  years  ago,  nurses  visual  monitored 
the  ECG's  of  every  patient  in  the  CCU.  Now  computers  are 
beginning  to  become  useful  in  supplementing  continuous 
human  monitoring  of  the  ECG's,  primarily  to  warn  the 
nursing  staff  if  the  patient's  condition  is  deteriorating 
(Refs  7,10). 

The  ECG  is  a graphical  representation  of  the  electrical 
activity  of  the  heart.  A cardiologist  can  learn  a great 
deal  about  the  present  condition  of  the  heart  from  it.  The 
ECG  has  three  important  deflections,  as  shown  in  Figure  1. 
The  P wave  is  caused  by  the  contraction  of  the  atrium  or 
upper  chamber  of  the  heart.  The  QRS  complex  and  the  T 
wave  are  caused  by  the  ventricles  or  lower  chamber  of  the 
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heart.  In  the  ECG  of  a normal  heart,  the  QRS  complex 
occurs  a fixed  time  after  the  P wave.  A signal  that  is  sent 
from  the  atrium  to  the  ventricles  causes  the  ventricles  to 
contract  at  the  proper  time.  This  signal  may  be  delayed 
or  completely  blocked  in  a heart  that  has  suffered  a heart 
attack.  If  the  signal  is  delayed,  the  ECG  will  have  a 
longer  distance  between  the  P wave  and  the  QRS  complex.  If 
the  signal  is  completely  blocked,  the  ECG  will  have  a 
variable  distance  from  the  P wave  to  the  QRS  complex.  The 
ECG  will  resemble  two  unsynchronized  signals,  usually  with 
slightly  different  repetition  rates.  This  type  of  ECG  is 
said  to  have  a disassociated  P wave.  Contemporary  computer 
algorithms  used  to  monitor  the  patients  in  a CCU  accurately 
warn  of  most  dangerous  cardiac  problems;  but  none  of  them 
can  locate  a disassociated  P wave. 

Previous  computer  programs  designed  to  locate  the  P 
wave  have  assumed  that  it  would  be  located  just  before  the 
QRS  complex.  This  restricts  the  search  for  the  P wave  to 
only  this  area  of  the  ECG.  The  programs  are  simple  and 
reasonably  accurate  for  most  cases  but  may  fail  completely 
if  the  ECG  has  a disassociated  P wave.  Lt.  Charles  Pairett 
at  the  Air  Force  Institute  of  Technology  developed  such  a 
program  which  achieved  a 96%  location  rate  for  P waves 
that  were  not  disassociated  (Ref  14) . He  did  not  test  it 
with  any  disassociated  P wave  ECG's;  however,  he  did  state 
that  this  would  cause  the  location  of  erroneous  waves. 

The  program  developed  for  this  thesis  does  not  make  the 
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assumption  that  the  P wave  is  located  between  the  T wave 
and  the  QRS  complex  and  it  searches  all  of  the  ECG. 

Limitations 

The  program  is  designed  to  be  only  a subroutine  of  a 
complete  ECG  monitoring  system.  It  is  assumed  that  the 
monitoring  program  will  supply  the  QRS  to  QRS  interval  to 
the  P wave  locating  subroutine.  The  P wave  locating  pro- 
gram only  classifies  a section  of  the  ECG  as  a P wave  and 
does  not  specify  the  exact  location  of  the  peak  or  the 
beginning  or  end  of  the  P wave. 

Electrocardiogram  Records 

The  ECG  data  used  to  develop  and  test  the  programs 
were  furnished  by  the  USAF  School  of  Aerospace  Medicine. 
They  contain  the  three  lead  electrocardiogram  of  18 
patients  of  the  School  of  Aerospace  Medicine  and  were 
received  as  digitized  records  on  a computer  tape.  The 
three  analog  components  (X,Y,Z)  of  the  ECG  were  sampled 
at  a 512  samples/second  rate  and  digitized  as  16  bit  inte- 
gers which  are  interleaved  (X-^,  Y^,  Z-^,  X^ , Y^  , Z.)f  . . .) 
on  the  tape.  To  prepare  the  data  for  use  in  this  thesis, 
the  X component  data  were  separated  and  placed  on  a com- 
puter file.  The  X component  was  selected  as  the  lead  to 
be  analyzed  because  its  P waves  had  areater  amplitude  than 
the  other  leads.  Appendix  A contains  samples  from  each  of 
the  13  records.  The  computer  programs  used  to  process 
the  data  are  listed  in  Appendix  B. 
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Algorithm  Organization 

The  programs  developed  for  this  thesis  use  the 
Tektronix  1410  Computer  Display  Terminal  operating  in  a 
time  sharing  mode  on  the  CDC  6600  computer.  The  programs 
are  written  in  Fortran  IV  Extended  and  use  subroutines  from 
the  Tektronix  Plot-10  graphics  system  (Ref  15) . 

The  first  program  was  used  to  interactively  select 
sections  of  the  FCG  to  examine  and  to  save  these  sections 
as  a training  set  for  the  feature  selection  process.  The 
program  plots  the  ECG  on  the  screen  of  the  terminal  and 
allows  the  operator  to  select  a part  of  the  display  to 
analyze.  The  operator  can  view  a frequency  domain  repre- 
sentation of  the  selected  section.  The  program  was  also 
used  to  obtain  frequency  domain  data  from  F wave,  T wave 
and  quiescent  sections  of  the  ECG. 

The  second  program  locates  P waves  using  frequency 
domain  and  rhythm  analysis.  It  plots  the  ECG  on  the 
terminal  and  indicates  what  sections  are  classified  as 
P waves.  Appendix  B contains  the  source  code  of  both 


programs . 


II. 


Initialization  Algorithm 


The  initializing  algorithm  is  used  to  collect  and 
analyze  data.  The  product  of  the  analysis  is  a set  of 
constants  used  in  the  classification  process  of  the  program 
to  locate  P waves.  The  data  are  collected  using  the  proto- 
type location  algorithm  and  analyzed  using  the  stepwise 
discriminant  analysis  . 

Pr o to type  Locating  Alg orithm 

The  program  developed  for  this  thesis  allows  the 
operator  to  locate  examples  or  prototypes  of  each  of  the 
classes  of  waveforms  in  the  FCG  and  to  examine  a plot  of 
the  frequency  domain  components.  The  program  can  also 
save  these  components  so  they  may  be  used  for  feature 
selection.  The  program  is  interactive  and  prompts  the 
operator  at  each  step  of  the  data  collection  process.  The 
flowchart  for  this  program  is  shown  in  Figure  2. 

Select  Mode.  At  the  start  of  the  program,  the  operator 
selects  how  the  program  will  be  used.  The  program  is  set 
to  process  the  sections  in  one  of  three  modes. 

1.  Plot  the  complex  components  of  the  Fast  Fourier 

Transform  (FFT) . 

2.  Plot  the  discrete  power  spectrum. 

3.  Output  the  FFT  and  the  discrete  power  spectrum. 

The  first  two  modes  are  used  by  the  operator  to  learn  about 
the  frequency  domain  characteristics  of  the  data. 
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Tne  number  of  sample  points  contained  in  the  prototype 
is  selected  at  the  start  of  the  program  and  must  be  the 
same  as  used  in  the  program  to  locate  P waves.  A prototype 
size  of  32  samples  was  used  for  the  thesis  because  this  is 
the  approximate  size  of  a typical  P wave . The  prototype 
may  be  set  as  large  as  64  samples. 

Read  and  Normalize  1024  Samples.  The  program  reads 
1024  samples  from  a computer  file  and  places  them  in  an 
array.  The  arithmetic  mean  of  the  samples  is  calculated 
and  subtracted  from  each  sample.  The  standard  deviation  of 
the  samples  is  found  and  is  divided  into  each  sample.  This 
produces  an  array  with  a mean  of  zero  and  a standard 
deviation  of  one.  It  also  removes  some  of  the  most  common 
nonphysiological  variations  from  the  ECG.  The  ECG  ampli- 
fier gain  and  the  electrode  placement  and  resistance  can 
cause  the  peak  to  peak  amplitude  of  the  ECG  to  vary.  These 
variations  can  be  minimized  by  dividing  each  sample  by 
the  standard  deviation  of  the  samples.  Vertical  shifts  in 
the  ECG  can  be  caused  by  movements  of  the  patient.  This 
can  be  minimized  by  normalizing  the  mean  of  the  data.  The 
program  to  locate  P waves  uses  the  same  normalizing  process. 

Plot  ECG.  The  ECG  is  plotted  on  the  terminal  screen 
with  a scale  that  indicates  sample  location.  Figure  3 shows 
the  terminal  display  of  the  prototype  locating  program. 

Select  Section  as  Prototype.  After  the  ECG  is  plotted 
on  the  terminal  screen,  a section  of  the  ECG  can  be  selected 
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by  the  operator  to  examine  or  save.  To  select  a section, 
the  starting  points  sample  location  is  entered  on  the  key- 
board of  the  terminal.  The  program  then  draws  two  vertical 
lines  that  delimit  the  prototype.  If  the  placement  of  the 
boundaries  is  not  correct,  the  location  can  be  moved. 

When  it  is  placed  correctly,  the  selected  section  will  be 
processed  according  to  the  mode  selected  at  the  start  of 
the  program. 

Process  the  Prototype.  The  samples  of  the  selected 
section  are  placed  into  the  first  32  real  elements  of  a 64 
element  complex  array  and  then  the  rest  of  the  elements  are 
set  to  zero.  A 64  point  FFT  is  performed  on  this  array. 

The  FFT  subroutine  operates  only  on  arrays  which  have 
dimensions  that  are  powers  of  two  (Ref  13) . A 64  sample 
FFT  was  used  to  make  the  FFT  components  frequency  incre- 
ments smaller  and  also  to  allow  the  prototype  size  to  be 
greater  than  32  samples.  Only  the  eight  low  frequency 
components  of  the  FFT  were  used  in  the  program  because 
they  contain  most  of  the  energy  of  the  FCG. 

If  the  program  is  set  to  display  the  frequency  domain 
data,  the  value  of  the  component  is  plotted  along  the 
abscissa  and  the  components  are  the  ordinate.  If  the  FFT 
components  are  plotted,  the  order  of  the  components  is  one 
to  eight  of  the  real  and  then  two  to  nine  of  the  imaginary 
as  shown  in  Figure  4.  If  the  discrete  power  spectrum  is 
plotted,  the  order  is  from  one  to  eight.  The  discrete 
power  spectrum  is  calculated  from  the  components  of  the  FFT. 
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Each  component  is  equal  to  the  square  root  of  the  sura  of 
the  square  of  the  real  and  imaginary  terms  of  the  FFT . 

There  are  interactive  commands  that  allow  the  expansion  of 
the  amplitude  of  the  whole  plot  or  each  components  ampli- 
tude can  be  changed. 

If  the  program  is  set  to  collect  prototypes  to  be 
used  in  feature  selection,  the  FFT  components  and  the 
discrete  power  soectrum  are  placed  on  a computer  file.  The 

components  are  selected  and  calculated  as  described  above.  ; 

Each  selected  section  of  the  FCG  could  be  represented  as 
a data  vector  in  23  space  using  these  components. 

Stepwise  Discriminant  Analysis 

The  stepwise  discriminant  analysis  is  used  to  select 
the  eight  components  of  the  frequency  domain  data  which 
best  separates  the  training  set  P waves  from  the  other 
waveforms  in  the  training  set.  The  training  set  was 
formed  from  four  ECG 1 s using  the  program  described  earlier. 

The  components  selected  by  the  stepwise  discriminant 
analysis  were  used  by  the  program  to  locate  P waves  for 
all  the  ECG's.  This  allowed  the  program  to  locate  P waves 
in  a new  ECG  without  performing  feature  selection. 

\ 

The  training  set  was  divided  into  three  groups:  l 

P wave,  baseline,  and  T wave . The  P wave  class  contained  , 

1 

only  P waves.  Any  waveform  that  was  not  a P wave  and  had 
less  energy  than  a P wave  was  placed  in  the  baseline  class. 

The  T wave  class  contained  T waves  and  also  QRS  waveforms 
and  premature  ventricular  contractions  (PVC's). 
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The  stepwise  discriminant  analysis  program  used  for 
this  thesis  is  called  BMD07M  and  was  developed  by  UCLA 
(Ref  5) . The  program  performs  a multiple  discriminant 
analysis  in  a stepwise  manner.  At  each  step  the  program 
adds  or  deletes  a component  to  improve  the  separation 
between  groups.  When  the  most  important  components  have 
been  selected,  the  canonical  variables  are  formed  using 
these  components.  The  canonical  variables  map  the  fre- 
quency domain  data  from  23  space  into  the  decision  space. 

For  each  of  the  canonical  variables,  the  program 
calculates  a weight  for  each  of  the  selected  components. 

A canonical  variable  is  equal  to  the  summation  of  each 
component  multiplied  by  its  weight  plus  a constant  to  posi- 
tion the  variable.  A three  class  problem  requires  a two 
dimensional  decision  space;  so  two  canonical  variables  are 
created  to  define  the  X and  Y axes.  The  group  mean  is 
calculated  by  evaluating  the  canonical  variables  using  the 
mean  value  of  each  of  the  components  of  the  group. 

The  variables  can  be  used  to  map  new  data  into  one  of 
the  three  classes.  The  frequency  domain  components  of  a 
section  of  an  ECG  will  represent  a point  in  this  decision 
space.  If  the  three  groups  had  equal  a priori  probabilities 
and  using  a minimum  probability  of  error  decision  criterion, 
the  wave  should  be  classified  into  the  group  whose  mean  is 
closest  to  the  waveform  in  the  decision  space. 

The  program  provides  statistical  information  about 
the  training  set  and  outputs  the  constants  for  the  canonical 
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variables  and  values  of  the  three  group  means.  It  also 
plots  the  classification  space  with  the  group  means  and  the 
training  set  points  as  shown  in  Figure  5.  The  constants  of 
the  two  canonical  variables  and  the  three  group  means  are 
inputs  to  the  program  to  locate  P waves . 
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Figure  5.  BMDO'/’s  Plot  of  the  Decision  Space 
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Location  Algorithm 


The  program  to  identify  P waves  plots  the  ECG  on  the 
terminal  screen,  divides  it  into  sections  and  classifies 
each  section.  Two  tests  are  used  to  select  the  sections 
to  be  classified  as  P w aves.  The  first  test  selects  the 
sections  that  have  a frequency  domain  pattern  similar  to 
the  training  set  P waves.  The  sections  that  pass  this  test 
will  be  called  candidates  and  they  are  tested  for  a peri- 
odic relationship.  If  a candidate  was  selected  at  a time 
assumed  to  be  one  P wave  period  before  the  present  candi- 
date, the  one  under  test  will  be  classified  as  a P wave. 

The  flowchart  for  this  program  is  shown  in  Figure  6. 

Initialize  Program 

The  values  of  the  constants  of  the  two  canonical 
variables  and  the  group  means  for  the  P wave,  T wave  and 
baseline  classes  are  read  from  a computer  file.  The  value 
of  the  constants  in  the  canonical  variables  for  the  compo- 
nents that  were  not  selected  in  the  feature  selection  is 
zero.  The  constants  for  the  selected  components  are  equal 
to  the  values  assigned  by  the  BMD07M  program. 

Read and  No rmalize  1024  Samples 

An  array  is  filled  with  1024  samples  from  a computer 
file  containing  the  ECG  data.  The  array  is  normalized  by 
subtracting  from  each  element  the  mean  value  of  the  array. 
The  standard  deviation  of  the  array  is  found  and  each 
element  is  divided  by  it.  This  removes  some  of  the 


variation  in  the  signal  that  is  caused  by  nonphysiological 
sources.  This  is  the  same  normalization  used  in  the  pro- 
gram to  collect  data  for  the  feature  selection. 

Plot  Electrocardiogram 

The  1024  samples  of  the  normalized  array  are  plotted 
on  the  terminal  screen  as  shown  in  Figure  7.  Except  for 
the  first  and  last  128  samples,  the  display  is  divided  into 
overlaping  sections  which  are  32  samples  wide  and  overlap 
16  samples.  The  exclusion  of  the  first  and  last  123 
samples  is  necessary  because  the  FCG  displays  overlap. 

Calculate  the  FFT  of  Section 

mhe  32  samples  of  the  section  are  placed  into  a 64 
element  complex  array  and  the  FFT  is  performed  on  it.  As 
in  the  first  program,  the  low  frequency  FFT  components  are 
selected  and  the  discrete  power  spectrum  is  created  from 
the  magnitude  of  the  FFT  components. 

Plot  Indicator  of  Croup  Mean 
heares  t the  Canon ical  Vari able 
Location 

The  two  canonical  variables  are  calculated  by  summing 
the  product  cf  each  component  found  in  the  last  step  and 
its  constant  selected  by  the  initialization  algorithm.  The 
two  variables  represent  a point  in  the  two  dimensional 
decision  space.  The  euclidean  distance  from  this  point  to 
each  of  the  group  means  found  in  the  initialization 
algorithm  is  calculated. 
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A vertical  line  is  drawn  on  the  terminal  screen  to 


indicate  the  starting  location  of  the  section  and  a hori- 
zontal line  is  drawn  to  indicate  which  group  mean  was 
nearest  the  location  of  the  canonical  variables.  The 
baseline  indicator  is  the  lowest,  the  T wave  is  above  the 
baseline,  and  the  P wave  is  the  highest.  Some  waveforms 
that  are  not  P waves  will  have  decision  space  locations 
that  are  closer  to  the  P wave  group  mean  but  at  a distance 
greater  than  any  of  the  P waves  in  the  training  set.  All 
the  P waves  in  the  training  set  mapped  into  a cluster  less 
than  three  units  from  the  group  mean.  If  the  P wave  group 
mean  is  the  closest  but  the  distance  is  more  than  four 
units,  it  will  not  be  selected  as  a candidate  P wave  and 
will  be  indicated  by  a height  that  is  between  that  of  the 
P wave  and  the  T wave.  These  indicators  are  shown  in 
Figure  7. 

Period  Test 

If  the  section  was  selected  as  a candidate  P wave,  a 
second  test  will  check  for  the  periodic  occurrence  of  the 
candidates.  The  P wave,  which  is  caused  by  the  contraction 
of  the  atrium,  is  normally  periodic.  The  atrium's  con- 
tractions are  rhythmic  because  it  is  controlled  by  the 
sinoatrial  node  in  the  heart.  This  will  occur  even  if 
there  is  damage  to  the  heart  which  causes  a disassociated 
P wave  in  the  FCG.  The  period  test  will  classify  the 
section  as  a P wave  if  a candidate  was  selected  at  a time 
assumed  to  lie  one  period  before  the  present  one. 
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Th 


_me  assumed  to  be  the  P wave  period  is  found  by 


examining  a histogram  of  candidate  occurrences  in  which 
each  entry  is  equal  to  the  number  of  previous  candidates 
that  were  separated  by  that  number  of  display  sections. 

The  histogram  locations  that  represent  possible  P wave 
repetition  rates  are  searched  for  the  maximum  value.  The 
R wave  period,  which  is  easier  to  measure  than  the  P wave 
period,  is  used  to  bound  the  region  of  search.  The  period 
of  the  P wave  could  be  from  1.1  to  .5  times  the  period  of 
the  R wave.  The  period  test  is  implemented  by  using  two 
arrays . 

1.  Array  PICKED  contains  the  history  of  which  of  the 
last  50  display  sections  were  selected  as  candidates. 

2.  Array  HIST  contains  the  histogram  of  candidate 
occurrence  for  distances  from  one  to  50  display 
sections . 

The  array  PICKED  contains  a flag,  the  value  one,  to 
indicate  when  a section  was  selected  as  a candidate. 
Otherwise  it  is  blank  with  the  value  zero.  The  subscript 
of  the  element  indicates  the  location  of  the  display  section 
that  it  represents.  The  first  element  represents  the 
present  section  and  the  second  element  represents  the 
previous  section.  When  the  program  goes  to  the  next  sec- 
tion, it  shifts  the  array  to  the  right.  The  flags  are  now 
in  elements  with  subscripts  that  are  one  greater.  The 
last  element,  which  represents  the  section  51  units  away, 
will  be  lost. 
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The  array  HIST  is  used  to  find  the  distance  between 
candidates  that  occurs  most  often  and  this  is  assumed  to  be 
the  P wave  PERIOD.  Each  time  that  a candidate  is  selected, 
the  array  PICKP^D  is  added  to  the  array  HIST  element  by 
element.  The  array  HIST  is  then  searched  over  the  range  of 
possible  P wave  periods.  The  R wave  period,  which  is  used 
to  define  the  range  of  search,  is  entered  on  the  terminal 
after  viewing  the  first  display  of  the  ECG.  In  an  opera- 
tional monitoring  program,  this  information  would  be  obtained 
from  the  main  program.  When  the  largest  value  is  found  in 
the  search  region  of  array  HIST,  the  subscript  is  used  as 
the  center  of  the  test  window  for  the  array  PICKED.  If  a 
flag  is  found  in  three  sample  wide  test  window,  the  section 
is  classified  as  a P wave.  Figure  3 shows  the  arrays  HIST 
and  PICKED  as  histograms  with  the  search  region  and  the 
test  window  shown.  To  indicate  that  a section  has  been 
classified  as  a P wave,  a vertical  line  is  drawn  on  the 
screen  through  the  center  of  the  section  as  shown  in  Fig- 
ure 7.  After  all  the  sections  in  the  display  have  been 
analyzed  and  the  results  indicated  on  the  terminal,  each 
section's  distance  from  the  P wave  group  mean  is  printed. 

The  sections  that  were  not  selected  as  candidates  are 
represented  by  zeros . 
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igure  8.  Histogram  of  Candidate  Occurrence 


The  results  will  be  discussed  in  two  sections. 


F i r s t 


the  initialization  algorithm  results  describe  the  selection 
of  the  training  set,  the  analysis  procedure  and  the  product 
of  the  analysis.  Then  the  results  of  the  location  algo- 
rithm describes  the  performance  of  the  P wave  locating 
program  using  the  18  ECG  records. 

Tnlti a li cation  Algorithm  Pesults 

A training  set  was  selected  and  then  analyzed  to  find 
the  important  features  to  be  used  to  select  candidates  in 
the  program  to  locate  P waves.  The  training  set  prototypes 
were  all  obtained  from  the  ECG  records  one  to  four.  Several 
prototypes  were  produced  from  each  P wave  with  it  positioned 
differently  in  each  prototype.  Its  position  in  the  proto- 
type window  was  shifted  eight  samples  from  the  last 
prototype.  This  continues  as  long  as  the  prototype  contains 
at  least  30%  of  the  energy  of  the  wave.  This  provides  two 
to  three  prototypes  from  each  P wave  and  a total  of  113 
P wave  prototypes.  The  peak  of  each  of  the  T waves  was 
selected  as  part  of  the  training  set.  The  T wave  group, 
which  also  included  four  PVC ' s and  five  ORS  complexes, 
contained  a total  of  69  prototypes.  The  baseline  class 
contained  64  randomly  selected  prototypes  from  sections  of 
the  ECG  which  contain  little  energy. 

The  BMD07M  discriminant  analysis  program  was  used  to 
analyze  the  training  set  and  produce  the  two  canonical 


variables  used  to  select  the  candidates  in  the  P wave 
location  program.  The  BMD07M  program  plots  the  location 
of  the  group  means  and  all  the  prototypes  in  the  training 
set  in  the  decision  space.  The  program  also  produces  a 
table  that  shows  how  the  prototypes  of  each  of  the  groups 
will  be  classified  using  the  canonical  variable  as  a deci- 
sion rule.  For  example,  it  states  how  many  of  the  P wave 
prototypes  will  be  classified  as  members  of  the  P wave , 

T wave  and  baseline  groups.  Both  of  these  outputs  were 
used  to  evaluate  the  separation  of  the  groups  and  the 
reliability  of  the  decision  rule.  The  separation  of  the 
T wave  and  baseline  groups  was  not  considered  important. 

The  goal  in  feature  selection  was  to  maximise  the  number 
of  P wave  prototypes  classified  as  P waves  and  to  minimize 
the  number  of  other  group  prototypes  that  are  classified  as 
P waves. 

The  membership  of  the  T wave  and  baseline  groups  was 
manipulated  to  improve  the  separation  of  the  P wave  group 
from  the  other  groups.  Initially,  the  three  groups  were 
analyzed  by  the  program  and  the  decision  space  plot  showed 
that  the  P wave  and  baseline  groups  formed  tighter  clusters 
around  their  means  than  the  T wave  group.  Twenty-five  of 
the  outer  T wave  prototypes  were  separated  into  a fourth 
class,  called  OH'1’,  which  was  not  used  in  any  of  the  feature 
selection  calculations  but  w as  plotted  in  the  decision 
space  to  ensure  that  they  continued  to  be  located  away  from 
the  P wave  mean.  The  prototypes  placed  in  the  group  OUT 


included  all  the  QRS  complexes.  The  first  decision  space 


plot  showed  that  some  of  the  T wave  prototypes  were  closer  ' 

to  the  baseline  group  mean  than  the  T waves;  so  they  were 
transferred  to  that  group. 

The  initial  BMD07M  program  analysis  classified  30%  of 
the  P wave  prototypes  as  P waves  and  15%  of  the  other  two 

classes  prototypes  as  P waves.  After  rearranging  the  | 

prototypes,  the  results  improved  to  86%  correct  P wave 

classification  and  only  10%  of  the  other  three  groups  \ 

prototypes  were  classified  as  P waves.  The  finial  decision 

space  plot  is  shown  in  Figure  9.  The  symbols  P,  B,  T and  0 ; 

represent  prototypes  of  the  groups  P wave,  baseline,  T wave 

and  OUT,  respectively.  The  * indicates  the  location  of  a 

group  mean.  The  $ indicates  that  two  or  more  prototypes 

are  located  at  that  point.  The  eight  components  selected 

as  the  most  important  are  the  FFT  real  components  two, 

three,  four  and  five.  Also  the  power  spectrum  components 

two,  three,  seven  and  eight  were  selected.  None  of  the 

imaginary  components  of  the  FFT  were  selected.  The  values 

of  the  constants  selected  for  the  canonical  variables  are 

given  in  Table  I.  The  three  group  means  are  listed  in  j 

Table  II.  I 

Location  Algorithm  Results 

The  P wave  locating  program  was  evaluated  using  the  , 

18  FCG  records.  The  performance  of  the  candidate  selection 

and  the  total  algorithm  is  discussed  separately.  The  data  j 
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igure  S.  Finiai  Decision  Space  Plot 


Table  I 


Constants  of  the  Canonical 
Variables 


Table  IT 
Croup  Means 


Group 

X 

Y 

P Wave 

-.61434 

-.75516 

Baseline 

1.35221 

.42375 

T Wave 

-2.10539 

1.61141 

Table  III 


Candidate  Selection 
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does  not  contain  any  true  disassociated  P wave  ECG's  but 
some  of  the  F waves  are  not  followed  by  QRS  complex  at  the 
normal  distance.  These  are  caused  by  PVC  activity  or 
partial  AV  node  blockage. 

The  candidate  selection  performance  is  listed  in 
Table  III.  The  P wave  is  scored  as  selected  if  a section 
containing  more  than  601  of  the  P wave  is  selected  as  a 
candidate.  The  P wave  selection  rate  is  31%  which  is 
approximately  the  rate  that  the  3MD07M  program  achieved  on 
the  training  set. 

The  total  algorithm  performance  is  given  in  Table  IV. 

If  two  adjacent  sections  are  classified  as  P waves  and  they 
share  a common  waveform  peak,  like  a P wave  or  a T wave,  they 
are  counted  as  one  classification.  The  program  correctly 
located  58.85%  of  the  P waves  and  had  approximately  one 
false  classification  every  two  and  one  half  P wave  cycles. 
Figure  10  shows  a display  from  ECG  record  13  in  which  the 
candidate  selection  failed  and  classified  the  P v;aves  as 
T v/aves.  Figure  11  shows  a display  from  ECG  record  5 in 
which  the  P v/aves  were  classified  as  member  of  the  baseline 
group.  Figure  12  shov/s  a display  from  ECG  record  3 where 
the  period  test  failed  a true  P wave  because  a PVC  covered 
the  previous  P wave.  Figures  13  and  14  show  displays  from 
ECG  records  13  and  14  respectively  in  which  the  P wave  is 
located  but  there  is  no  QRS  complex  following  it.  These 
show  how  the  algorithm  could  operate  on  an  ECG  with  a 
disassociated  P wave.  Figure  13  shows  the  location  of  a 
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Table  IV 

Total  Classification  Results 


ECG 

Record 

Actual 

P 

Waves 

P Waves 
Located 

Class 

Incorrectly 
ified  as  a P Wave 

T Waves 

Other 

Waveforms 

Total 

1 

11 

2 

0 

10 

10 

2 

13 

0 

3 

0 

3 

3 

12 

9 

2 

3 

5 

4 

9 

5 

2 

1 

3 

5 

8 

0 

3 

1 

4 

6 

17 

11 

6 

0 

6 

7 

9 

3 

1 

0 

1 

3 

16 

15 

7 

4 

11 

9 

12 

9 

3 

0 

3 

10 

10 

7 

0 

2 

2 

11 

12 

8 

2 

0 

2 

12 

10 

2 

3 

1 

4 

13 

13 

5 

3 

0 

3 

14 

10 

8 

2 

6 

3 

15 

10 

10 

0 

0 

0 

16 

13 

1 

11 

0 

11 

17 

12 

10 

2 

0 

2 

13 

12 

9 

4 

1 

5 

Total 

209 

123(53.85%) 

54 

29 

33 
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Figure  ii.  Analysis:  ECC-  Record 


Analysis:  ECG  Record 
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Record  14 


P wave  that  occurs  during  a PVC.  The  PVC  prevents  the  " 

normal  occurrence  of  the  ORS  complex  following  the  P wave.  * 

Figure  14  also  shows  the  location  of  a P wave  during  PVC 
activity.  The  shape  of  the  second  P wave  is  distorted  by 
the  depolarization  of  the  heart  following  the  PVC. 
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V.  Conclusions  and  noconrnndations 

Conclus ions 

The  algorithm  is  not  reliable  enough  to  provide  useful 
F wave  location  information.  The  selection  of  a training 
set  to  be  used  to  initialize  the  program  for  all  ECG ' s is 
not  a reliable  method  of  training  the  classifier.  The 
histogram  period  test  is  reliable  where  the  P wave  is  not 
covered  by  a high  energy  wave. 

Recommendations 

The  program  can  be  improved  in  two  ways:  individual 
initialization  and  extended  period  test.  One  problem  is 
the  extreme  variation  of  the  P v/ave  shape  between  patients 
and  the  similarity  of  some  P waves  and  T waves.  This  is 
demonstrated  by  the  results  of  ECG  records  1,  2,  12  and  1G. 
The  program  located  less  than  20%  of  the  P waves  in  these 
records.  This  problem  could  be  solved  by  training  the 
candidate  selection  using  the  initialization  algorithm  with 
data  from  only  the  ECG  to  be  monitored  (Ref  4) . Using  the 
algorithm  in  this  thesis  would  require  the  operator  to 
select  P waves  from  the  ECG  to  initialize  the  program. 

The  period  test  does  not  classify  a P wave  correctly 
if  the  previous  P wave  was  not  located  because  it  was 
covered  by  a PVC  or  other  waveform.  The  program  must  be 
able  to  locate  the  P wave  under  these  conditions  to  be 
reliable  on  disassociated  P v/ave  ECG ' s . There  are  two 
methods  to  improve  the  period  test.  The  candidates  could 
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be  retroactively  classified  if  they  are  followed  by  a 
candidate  at  the  proper  time.  The  second  way  would  be  to 
have  an  additional  test  that  looked  back  two  or  more  P wave 


periods  to  check  for  candidates  in  a window. 
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